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Claim 

A source program compression method characterized by an encoding processing means 
and a storage means with areas for recording pre-stored reserved symbols in table form and 
variable symbols in table form, wherein the aforementioned encoding processing means 
categorizes the words constituting the source program into a group of reserved words for 
operators, a group of numerical value constants, a group of character string constants, and a 
group of variables, assigns corresponding word numbers to the words contained in the operator 
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reserved word group with reference to the aforementioned reserved symbol table, assigns 
corresponding word numbers to the words and encoded numerical values to the words contained 
in the numerical value constant group, assigns corresponding word numbers, numerical values 
indicating the lengths of the character strings, and the words for said character strings to the 
words contained in the character string constant group, records the words contained in the 
variable group in the variable symbol table, and adds the corresponding word numbers and the 
numbers indicating the locations they are registered in said symbol table to them for output. 

Detailed explanation of the invention 
Industrial application field 

The present invention pertains to a source program compression method used for 
converting a source program into intermediate codes in an interpreter, such as a BASIC 
interpreter, widely utilized for a personal computer. 

Prior art 

Conventional interpreters, represented by the BASIC interpreter, e.g., include those in 
which the source program is interpreted directly with almost no conversion and in which it is 
interpreted after the data are converted into intermediate code for compression. However, a 
method in which only some of the operators, reserved words, and numerical value constants were 
encoded, and the other character string constants and variables were rewritten into intermediate 
code while they remained as character code used for external expression was used as the 
conventional intermediate encoding method. 

Problems to be solved by the invention 

In this conventional method, including not only the method in which there was almost no 
conversion of the source program but also the method utilizing intermediate encoding for 
compression, because while some of the operators, reserved words, and numerical value 
constants were encoded and the rest of the data remained as character codes for external 
expression, the compression rate relative to the data size of the source program was low. 

In addition, although intermediate encoding was applied, because some portion remained 
as character codes for external expression, interpretation was to be carried out simultaneously 
with word analysis in order to execute a program of said type. 

For example, assuming that up to 8 characters were allowed as the character length for a 
variable name, the word analysis required reading of character codes for up to 8 times, which 
imposed a restriction on the improvement of the execution speed. In addition, even where the 
processing was branched, as designated by a label such as the "goto*label," the label name was 
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rewritten by using the character codes for external expression as they were. Thus, it had to be 
interpreted by reading the character codes one at a time. Furthermore, when the destination of the 
jump indicated by the "goto*label" was located ahead of the hne being executed, there was only 
the possibility of skipping the intermediate code character by character until the "goto*laber' was 
encountered, so that a drop in the execution speed was inevitable. 

In light of such conventional problems, the present invention presents a source program 
compression method with which the compression rate of intermediate codes can be improved in 
order specifically to improve the processing speed. 

Means to solve the problems 

The means for solving the aforementioned problems is a source program compression 
method characterized by an encoding processing means and a storage means with areas for 
recording a table of preregistered reserved symbols and a table of variable symbols, wherein the 
aforementioned encoding processing means categorizes the words constituting the source 
program into a group of reserved words for operators, a group of numerical value constants, a 
group of character string constants, and a group of variables, assigns corresponding word 
numbers to the words contained in the operator reserved word group with reference to the 
aforementioned reserved symbol table, assigns corresponding word numbers to the words and 
encoded numerical values to the words contained in the numerical value constant group, assigns 
corresponding word numbers, numerical values indicating the lengths of the character strings, 
and the words for said character strings to the words contained in the character string constant 
table, registers the words contained in the variable group in the variable symbol table, and adds 
the corresponding word numbers and the numbers indicating the locations they are recorded in 
said symbol table to them for output. 

Application example 

The present invention will be explained in detail on the basis of an application example. 
Figure 1 is a block diagram illustrating the concept of the present invention. Figure 2 is a flow 
chart showing an example in which the encoding processing means in Figure 1 is realized using a 
program. Figure 3 is an example of the source program. Figure 4 shows an example in which said 
source program is converted into intermediate code according to the aforementioned application 
example. In Figure 1, encoding processing means 2 obtains words one after another from source 
program 1, categorizes them, generates corresponding word numbers with reference to reserved 
symbol table 3 that is a table showing the correspondence between words and word numbers, and 
outputs them as intermediate code 5. That is, in Figure 2, encoding processing means 2 operates 
as follows. First, it obtains words one after another. Then, 




4 

1) when the word is a line number, it outputs its word number and the line number, 

2) when the operator is a reserved word, it outputs each corresponding word number, 

3) when the word is a numerical value constant, it outputs its word number and the 
numerical value converted into an internal expression, 

4) when the word is a character string constant, it outputs its word number, character 
length, and character string, and 

5) it records the word as a variable into the variable symbol table when it is not subsumed 
under aforementioned 1) through 4) and outputs its word number and the recorded position 
number. 

According to the aforementioned operation, when the source program in Figure 3 is input 
in this example, because the number 100 at the beginning of the line is a line number, the 
corresponding word number (= 1) and line number (= 100) are output. Because the next SUM is 
not a line number, an operator, a reserved word, a numerical value constant, or a character string 
constant, it is recorded in the variable symbol table, and the word number (= 10) and registration 
location number (= 1) are output. Because next = is an operator, the corresponding word number 
(= 7) is output. The same operation is repeated thereafter in order to create and output the table 
shown in Figure 4 comprising intermediate codes and variable symbol table. A label is treated as 
a variable, recorded in the variable symbol table, and assigned with the word number and the 
registration location number corresponding to the variable. 

Effects of the invention 

As described above, the source program compression method of the present invention 
offers many advantages in that a high compression rate can be achieved because not only 
operators, reserved words, and numerical value constants but also variables and character string 
constants can be encoded. 

Furthermore, all words, including variables (including labels), are assigned with word 
numbers, and the meanings of intermediate codes are interpreted immediately according to their 
word numbers. In particular, unlike in the prior art, the trouble of analyzing a word character by 
character in order to execute a label is no longer required, its contents can be understood 
immediately using the word number and the variable symbol table, so that the execution speed 
can be increased. 

Brief description of the figures 

Figure 1 is a block diagram showing the concept of the present invention. Figure 2 is a 
flow chart showing an example of the encoding processing means. Figure 3 is an example of the 
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source program. Figure 4 is a diagram for explaining an example in which the soiu-ce program in 
Figure 3 is converted into intermediate codes. 

1 . . . source program; 2 . . . encoding processing means; 3 . . . reserved symbol table; 4 . . . variable 
symbol table; and 5 . . . intermediate code. 
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Figure 1. Block diagram illustrating the concept of the present invention 



Key: 1 Source program 

2 Encoding processing means 

3 Reserved symbol table 

4 Variable symbol table 

5 Intermediate code 
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Figure 2. Flow chart showing an example of the encoding processing means 



Key: 1 Obtain the next word 

2 Is it a line number? 

3 Is it an operator? 

4 Is it a reserved word? 

5 Is it a numerical value constant? 

6 Is it a character string constant? 

7 Register it into the symbol table 

8 Output word number (= 1) and line number 

9 Output corresponding word number 

1 0 Output word number (=11) and numerical value 

1 1 Output word number (= 12), character length, and character string 

12 Output word number (= 10) and registration location number 
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Figure 3. Example of the source program 
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Figure 4. Diagram for explaining intermediate codes 



Key: 1 Line number 

2 Variable 

3 Operator 

4 Numerical value constant 

5 Reserved word 

6 Variable symbol table 

7 Symbol 
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